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FIELD OF THE INVENTION 

The present invention relates generally to communication networks, and more 
particularly, to communication networks utilizing link state protocols. 

BACKGROUND OF THE INVENTION 

Communication networks can include various types of protocols that route data 
through the network. One such type of protocol is referred to as a link-state protocol. 
Known link-state protocols include Open Shortest Path First (OSPF), which is used in 
Internet Protocol (IP) networks, and Private Network-Network Interface (PNNI), which 
is used in Asynchronous Transfer Mode (ATM) networks. 

IP and ATM networks are generally organized into one or more areas each of 
which includes a link-state database. Link-state routing protocols rely on the exchange of 
a relatively large number of control messages within each area as the network comes "up," 
i.e., becomes operational. For example, the network nodes send and receive Link State 
Advertisement (LSA) messages in the OSPF protocol and PNNI Topology State Update 
(PTSE) messages in the PNNI protocol for enabling each node to determine the network 
topology. As the (OSPF) network comes up, OSPF LSA messages are flooded 
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throughout a network area. A given node can receive more than one copy of the same 
LSA message in which case the first LS A message is regarded as the original and the other 
LSA messages are regarded as duplicates. An original LSA message is acknowledged 
over the trunk from which it came and copies of the message are flooded over the other 
5 trunks. Duplicate messages are typically discarded after processing. 

Another type of OSPF control message is the HELLO message that is periodically 
exchanged over each trunk connecting neighboring nodes. The HELLO messages are 
used to determine the status of the trunks, i.e., whether a given trunk is up. There are also 
10 some timers which, if expired, result in the generation of control messages. Examples of 
timers include LSA retransmission timers, HELLO refresh timers and LSA refresh timers. 



Generally, link-state routing protocols do not specify the order in which the 
various control messages are to be serviced when more than one message is outstanding at 
15 a network node processor. In accordance with conventional practices, the control 

messages are serviced in a First-Come-First-Served (FCFS) manner. In some instances, 
control messages triggered by the expiry of a timer are serviced at a higher priority than 
other messages without making any further distinctions between the message types. 

20 One disadvantage with such link-state message processing schemes is that certain 

message types may not be timely processed due to network congestion whenever a 
relatively large number of LSA messages is generated within a relatively short time 
interval in the network. Such an event is referred to as an "LSA storm." The network 
congestion can be the result of nodes/trunks going "down" or coming back up. An LSA 

25 storm can be generated due to the failure or recovery of a single trunk, group of trunks, 
single node, or group of nodes. The failure/recovery can result from a hardware failure or 
software upgrade, for example. The LSA storm can also be generated due to a near- 
synchronous refresh of large numbers of LSAs and to sudden bandwidth changes in virtual 
circuits in the network. 
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One problem associated with LSA storms is the loss of trunks due to excessively 
delayed processing of HELLO messages. As long as a trunk between neighboring nodes 
is considered up, HELLO messages are exchanged between the nodes over the trunk 
5 periodically with period T, which is typically between about 5 and 10 seconds. If one of 
the neighboring nodes does not receive a HELLO message for a predetermined number of 
consecutive times, e.g., four, the node declares the trunk to be down. 

During an LSA storm, HELLO messages can pile up until HELLO messages from 
10 neighboring nodes such that they may not processed in a timely manner. For example, 
HELLO messages are queued behind other control messages arriving at the node before 
the HELLO messages. Furthermore, if timer-triggered messages are served at a higher 
priority, then the HELLO messages also have to wait behind control messages triggered 
by the expiration of a timer. If the total waiting time of a HELLO message is longer than 
15 a specified duration nT, which is typically between 15 and 40 seconds, then the trunk will 
be declared down even though it is up. 

For example, a node having 50 trunks and a 1 millisecond processing time for 
receiving or transmitting a message over a trunk can experience a HELLO message 
20 queuing delay of about 15 seconds with an LSA storm of size 150, and a queuing delay of 
about 40 seconds with an LSA storm of size 400. The LSA storm size corresponds to the 
number of LSA messages in an LSA storm. If the processing time is doubled, e.g. 2 ms, 
then the same queuing delays would result from LSA storms half as large. 

25 Declaring a trunk down while it is actually up is disadvantageous for several 

reasons. Declaring the trunk down triggers the flooding of LSA messages to the entire 
area (or areas) in which the trunk is located. In addition, all Virtual Circuits (VCs) over 
the trunk are released and rerouted. Once the waiting time of a HELLO packet is over 
and the message is processed, the node may declare the trunk up causing possible further 

30 VC rerouting. Declaring trunks down while they are up also results in wasted bandwidth 
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and inefficient routing. Furthermore, erroneously declaring trunks down on a relatively 
large scale can cause the entire network (or area of the network) to enter an oscillatory 
state that can bring the network down. Thus, LS A storm effects are exacerbated by the 
very events of trunks going down and up. 

5 

A further disadvantage associated with conventional link state message processing 
is the occurrence of so-called LSA retransmission lockout in which the node processor 
enters a loop that processes only retransmissions and other timer-triggered messages. 
Thus, the node processor does not process HELLO, LSA and LSA acknowledgement 
10 messages arriving from other nodes while in the loop. LSA retransmission lockout can 
occur when timer-triggered messages are served at a higher priority than other messages 
and the timer-triggered messages are generated at a rate equal to or higher than the rate at 
which they can be processed by the node processor. 

15 LSA retransmission lockout typically results from a combination of events. There 

are generally three main types of timers: HELLO refresh timers, LSA refresh timers, and 
LSA retransmission timers. The rates of message generation due to the expiry of the 
HELLO and the LSA refresh timers are fixed and independent of network conditions 
(typically one HELLO message per 5 to 10 seconds per trunk and one LSA refresh every 

20 30 minutes per LSA originated by the node). Thus, these messages require only a 
relatively small fixed fraction of the node processing power. 

The rate of message generation due to the expiry of LSA retransmission timers is 
typically one message every 5 seconds per unacknowledged LSA. This rate depends upon 

25 the level of network congestion. Under normal operating conditions the rate of message 
generation is close to zero since very few LS As remain unacknowledged for more than 5 
seconds. However, under heavy network congestion generated by an LSA storm, it is 
possible for many LSAs to remain unacknowledged for more than 5 seconds due to 
congestion either at the transmitting node or at the receiving node, such that LSA 

30 retransmission lockout can occur. 



4 




Once a node processor enters a retransmission lockout state, it does not process 
any messages that are not triggered by a timer. This includes acknowledgements to earlier 
transmissions and retransmissions that would help the node processor to get out of the 
5 retransmission lockout state. Eventually the node processor can get out of the 
retransmission lockout since the LSAs being retransmitted age out. However, this 
happens after an unacceptably long time, e.g., one hour, before which the node typically 
goes down. 

10 It would, therefore, be desirable to provide a link-state network protocol that 

enhances the ability of a network to handle LS A storms. 

SUMMARY OF THE INVENTION 

The present invention provides a mechanism for link state network protocols to 

15 identify certain link state routing control messages, to store the identified messages in 
respective queues for each message type, and to process the messages in a weighted 
arrangement, such that each message type is allotted a predetermined amount of node 
processing power. For each visit to a particular message queue, the processing time can 
have an upper limit. By distributing node processing power to the various message types, 

20 the network reliability and scalability is enhanced as compared with conventional link state 
networks. Although the invention is primarily shown and described in conjunction with 
the Open Shortest Path First (OSPF) protocol, it is understood that the invention is 
equally applicable to other link state protocols, such as the Private Network-Network 
Interface (PNNI) protocol. 

25 

In one aspect of the invention, a communication network includes a plurality of 
areas in which network nodes are located. The network nodes identify certain link state 
routing control messages and store them in respective queues. In one embodiment, 
identified OSPF messages include HELLO messages, Link State Advertisement (LSA) 
30 messages, and LSA acknowledgement messages, each of which is sent from other nodes. 
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Certain timer-triggered messages generated by the node itself, such as HELLO refresh 
timer messages, LSA refresh timer messages, and LS A retransmission timer messages can 
also be identified. Each message type is stored in a separate queue to which weights are 
assigned for allotting a predetermined amount of processing power to each of the message 
5 types. In one embodiment, the message processing sequence is determined by weighted 
round robin processing of the message queues. 

In a further aspect of the invention, a method for processing control messages in a 
link state network includes identifying certain control messages and storing the identified 

10 messages in respective queues. The method further includes assigning a respective weight 
to each message type queue and processing the queued control messages in a sequence 
such that each message type is allotted a predetermined amount of processing time. In 
one embodiment, incoming OSPF messages identified by a node include HELLO, LSA, 
and LSA acknowledgement messages and identified self-generated messages include 

15 HELLO refresh, LSA refresh, and LSA retransmission messages. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will be more folly understood from the following detailed 
description taken in conjunction with the accompanying drawings, in which: 

20 

FIG. 1 A is a schematic representation of an exemplary network configuration 
having link-state message processing in accordance with the present invention; 

FIG. IB is a further schematic representation of an exemplary network 
25 configuration having link-state message processing in accordance with the present 
invention; 

FIG. 2 is a diagram showing an exemplary packet structure for carrying link-state 
messages in the network of FIGS. 1 A-B; 

30 
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FIG. 3 is a flow diagram showing an exemplary sequence of steps for identifying 
and storing certain link state control messages in accordance with the present invention; 

FIG. 4 is a flow diagram showing an exemplary sequence of steps for processing 
5 stored control messages in accordance with the present invention; and 

FIG. 5 is a pictorial representation of an exemplary sequence for processing 
messages stored in weighted queues in accordance with the present invention. 

10 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a network utilizing a link-state protocol in which 
network nodes identify certain types of control messages and store the messages in 
respective weighted queues such that a predetermined amount of processing power is 
allotted to each message type. During each visit to a message queue, there can be an 

15 upper limit on processing time for the message. 

FIGS. 1 A-B show an exemplary communication network 100 that processes 
certain Open Shortest Path First (OSPF) link-state protocol control messages in 
accordance with the present invention. FIG. 1 A shows link state advertisement (LSA) 
20 message flooding and FIG. IB shows HELLO message generation, in which like reference 
element indicate like elements. 

The network 100 includes four nodes Ni,N 2 ,N 3 ,N 4 that are interconnected by six 
trunks Ti,T 2 ,T 3 ,T 4 ,T 5 ,T 6 . The first trunk Ti connects the first and second nodes Ni,N 2 

25 and the second trunk T 2 connects the first and fourth nodes Ni,N 4 . The second and fourth 
nodes N 2 ,N 4 are interconnected by the fourth and fifth trunks T 4 ,T 5 . The sixth trunk T 6 
connects the third and fourth nodes N 3 ,N 4 . While the trunks T are shown as bi- 
directional, it is understood that the trunks can be uni-directional as well. It is further 
understood that the nodes N can be provided from a variety of devices including IP 

30 routers and ATM switches (using the PNNI protocol), for example. 
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As shown most clearly in FIG. 1 A, a single LSA message is generated by the third 
node N3 and is flooded to the rest of the network. LSA messages include original (or 
first-time) LSA messages Lo and duplicate LSA messages L D . In general, nodes receiving 
5 original LSA messages Lo respond with an acknowledgement message A and flood a copy 
of the original LSA message over other trunks and nodes. Duplicate LSA messages L D 
are disregarded. 



The third node N 3 sends the original LSA message Lo over the trunks T 3 ,T 6> 
interconnecting the third node with the neighboring second and fourth nodes N 2 ,N 4> 
respectively. The second node N 2 then sends an acknowledgement message A back to the 
third node N 3 over the third trunk T 3 and floods the LSA messages over interconnecting 
trunks Ti,T 4 ,T 5 to the first and fourth nodes Ni,N 4 and N 4 , respectively. The LSA 
message Lo to the first node Ni from the second node N 2 is a first-time LSA. The LSA 
messages L D to the fourth node N 4 from the second node N 2 over the interconnecting 
trunks T^Ts are duplicates since the fourth node N 4 previously received a first-time LSA 
message from the third node N 3 . 

The first node Ni sends an LSA acknowledgement message A to the second node 
20 N 2 over the connecting trunk Ti and floods an LSA message L D to the fourth node N 4 
over a trunk T 2 . The LSA message is a duplicate since the fourth node N 4 previously 
received a first-time LSA message Lo from the third node N 3 . The fourth node N 4 sends 
an acknowledgement message back to the third node N 3 over the trunk T 6 and floods LSA 
messages L D over interconnecting trunks T 2 ,T 4 ,T 5 to the first and second nodes Ni,N 2 and 
25 N 2 , respectively (T 4 and T 5 both connect N 2 and N 4 ). These LSA messages are all 

duplicates since the first and second nodes Nj,N 2 have already received first-time LSA 
messages Lo from other sources. A duplicate LSA message is an implicit 
acknowledgement such that no separate acknowledgement is necessary. 
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FIG. IB shows the exchange of HELLO messages H among the nodes N1-N4 in 
the network 100. Once every Th seconds (where Th is typically 5 or 10), each node Ni- 
N 4 sends a HELLO message over each trunk Ti-T 6 to the neighboring node on the other 
side of the trunk. 

5 

In accordance with the present invention, control messages are identified by the 
node and stored in separate message queues. Self-generated messages, such as timer- 
triggered messages, can be readily identified by the node. For messages from other nodes 
in the network, the recipient node can identify message types by examining specified 
10 portions of the packet structure. 

FIG. 2 shows an exemplary packet structure 150 for carrying link-state messages, 
e.g., LSA, LSA acknowledgement and HELLO messages. In one embodiment, the 
message type is identified by examining a field in a link-state protocol header 152 that 

15 uniquely identifies each message type. For example, the OSPF Packet Type field in the 
OSPF Protocol can be used for message type identification. For faster identification, the 
message type can be identified by examining a lower layer protocol header 154. For 
example, an unused portion of the Type of Service (TOS) field in the IP Packet header or 
an unused portion of the Virtual Circuit Identifier (VCI) field in the ATM Packet header 

20 can be used for this purpose. The packet body 156 contains the message. 

FIG. 3 shows an exemplary sequence of steps for identifying certain types of 
incoming OSPF control messages. In one embodiment, the OSPF messages that are 
identified include HELLO messages, LSA messages (original or duplicate), and LSA 
25 acknowledgements. Each of these incoming messages is transmitted by another node via a 
trunk. 

In step 200, a node in the network receives a message from another node and 
examines the message header in step 202 to determine whether the message is one of the 
30 specified routing control messages to be stored in a respective queue. In step 204, the 
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node determines whether the received message is a HELLO message. If the message is a 
HELLO message from another node, the node stores the message in the HELLO message 
queue Ql in step 206. 



the message is an LSA message. If the message is an LS A message, the node stores the 
LSA message in the LSA message queue Q2 in step 210. If the message is not an LSA 
message, in step 212 the node determines whether the received message is an LSA 
acknowledgement message, and if so, stores the LSA acknowledgement message in the 
10 LSA acknowledgement queue Q3 in step 214. 

As the node receives further messages from other nodes, the node identifies 
HELLO, LSA, and LSA acknowledgement messages and stores them in the 
corresponding message queues Q1,Q2,Q3. It is understood that during an LSA storm, the 
15 message queues store the received messages for ultimate processing by the node, as 
described in detail below. 

In addition to routing control messages sent by other nodes, the node identifies 
certain self-generated messages, such as timer generated messages. Exemplary self- 

20 generated control messages for storage and weighted processing include HELLO 

messages triggered by the HELLO refresh timer, LSA messages triggered by the LSA 
refresh timer, and LSA messages triggered by the LSA retransmission timer. It is 
understood that further messages can be identified and stored by the node, such as various 
control messages triggered by hardware-based timers. Since these messages are generated 

25 by the node itself, they are readily identified and stored in respective queues. 
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If the message is not a HELLO message, in step 208 the node determines whether 
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In one embodiment, six message queues Q 1-6 are formed as shown below in Table 

1. 

Table 1 
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The HELLO, LSA, and LSA acknowledgement messages are received from other nodes 
and can be identified by examining a field in the packet header, as described above. The 
remaining messages, i.e., the timer based messages, are generated by the node such that 
they are readily identified by the node. 

In general, the message queues Ql-6 are assigned a weight so as to apportion a 
predetermined amount of processing power, which can be based upon CPU cycles, to each 
of the queues. The queues are weighted to optimize message processing during LSA 
storms for mniimizing the likelihood that the network goes down due to a software 
upgrade or hardware failure, for example. 

It is understood that a variety of weighting schemes can be used to vary the 
processing time allotted to each of the queues. Exemplary schemes include weighted 
round robin and weighted fair queuing. 

20 FIG. 4 shows an exemplary sequence of steps for processing the queued control 

messages in a weighted round robin manner. In step 300, the relative weight desired for 
each message queue is determined. The relative weights are selected such that each 
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message type receives a selected processing rate even when one or more message queues 
are overloaded. It is understood that the weights can be multiplied by a constant such that 
each weight in an integer. In step 302, a round robin polling table is generated such that 

6 

the number of entries corresponds to the sum of the weights, i.e., = . Each 

i=i 

5 entry corresponds to one of the message queues, which has an associated weight. Unless, 
each queue is given a weigh of one, at least one message queue has multiple entries in the 
polling table as demonstrated in the exemplary polling table of FIG. 5. 

FIG. 5 shows an exemplary weighted round robin polling table 350, in which the 
10 sum of the queue weights corresponds to the number of entries in the polling table, e.g., 
Wi=W 3 =W 4 =2, W 2 =W 5 =W 6 =1, and W SU m= 9. That is, the weight assigned to Ql 
(HELLO), Q3 (LSA acknowledgement), and Q4 (HELLO refresh) is two and the weight 
assigned to Q2 (LSA), Q5 (LSA refresh), and Q6 (LSA retransmission refresh) is one. 
The sum of the weights assigned to the queues Q 1-6 is nine, which corresponds to the 
15 number of entries in the round robin polling table 350. The entries in the polling table are 
then positioned to minimize the distance between visits to a particular queue. More 
particularly, the entries corresponding to the same queue should generally be spread 
evenly throughout the polling table. It is understood that the position of the table entry 
determines the sequence in which the queues are served by the node processor. 



For example, the polling table 350 has two entries for Ql (HELLO messages) that 
should be spaced apart rather than placed together. More particularly, the maximum 



distance between successive visits to the polling table entries is 



W 

ff SUM 



As known to 



one of ordinary skill in the art, the f ] operator returns the smallest integer greater than or 
25 equal to the argument. Thus, the distance is five for queues Q1,Q3,Q4 having two entries 
and the distance is nine for queues Q2,Q5,Q6 having a single entry. The node can process 
the messages in the queues Q 1-6 in a predetermined direction, such as the clockwise 
direction indicated by arrow 352. 
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Referring again to FIG. 4 in conjunction with FIG. 5, in step 304, the node 
processor selects the first entry Ql in the round robin polling table 350. In step 306, the 
node processor determines whether there is a HELLO message in the HELLO queue Ql 
to be processed. If there is not a HELLO message in the HELLO queue Ql the next 
entry, i.e., Q3, in the polling table 350 is selected in step 312. If there is a message in the 
HELLO queue Ql, the node processor determines whether the processing time for the 
message is less than or equal to a maximum processing time Pmax in step 307. If so, the 
complete message is processed in step 308 and the next polling table entry is selected in 
step 310. 

If the message processing time is greater then Pmax as determined in step 307, a 
portion of the message up to Pmax is processed by the node processor in step 314. The 
next polling table entry is then selected in step 310. 

In this manner, each of the message queues in the polling table 350 is serviced in a 
predetermined order, i.e., Q1-Q3-Q4-Q2-Q5-Q1-Q3-Q4-Q6 and back to Ql. It is 
understood that the arrangement of the message queues that defines the processing 
sequence can be readily modified by one of ordinary skill in the art. 

This arrangement dedicates a predetermined portion of the node processing power 
to each message queue Ql-6. The portion assigned to message type / can be computed 

as: 



where Pj is the processing time for processing a message of type j assuming that Pj does 
not exceed an upper limit P,™, Wj is the weight assigned to message type j, and Wi is the 
weight assigned to message type i. 



WP 
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In addition, valuable processing power is not wasted when there are no messages 
in a given queue since the total routing processor power is distributed among the other 
message types in proportion to their relative weights. 

5 The weighted round robin processing of link-state control messages described 

above reduces or eliminates network failures due to excessively delayed HELLO messages 
and LSA retransmission lockout. By sufficiently weighting the HELLO and HELLO 
refresh messages queues Q1,Q4, the node processor can process these messages at a 
predetermined rate independent of the network congestion level so that a trunk is not 

10 declared down when it is actually up. 

In addition, retransmission lockout should not occur since even in the case where 
there is a relatively high number of messages to be retransmitted a fraction of the node 
processing power is allotted to messages that are not triggered by the expiry of timers, i.e., 
15 HELLO (Ql), LSA (Q2), and LSA acknowledgement (Q3) messages. Furthermore, even 
if a relatively large number of unacknowledged messages accumulate following a severe 
LSA storm, these messages dissipate relatively quickly since acknowledgments are 
processed at a predetermined rate regardless of the level of network congestion. 

20 It is understood that in addition to the processing of routing control messages as 

described above, the node processor provides other functionality as well, such as 
processing signaling messages, network control messages, and the like. In general, a 
fraction F of the total node processing power is dedicated for the processing of routing 
control messages. In one embodiment, a polling table for the node includes W RC + Wp 

25 entries of which W RC entries are dedicated to processing routing control messages and Wp 
entries are dedicated to other tasks. Thus, F=W rc P/(WrcP + WpP'), where P is the mean 
processing time of a single routing control message and P' is the mean processing time of 
a single message of the other tasks. 
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While the invention is primarily described in conjunction with OSPF, it is 
understood that the invention is equally applicable to PNNI in ATM and other link-state 
protocols. 

The below examples of conventional link state message processing and link state 
message processing in accordance with the present invention demonstrate how the 
invention overcomes some of the disadvantages of conventional message processing 
declaring trunks down even though they are actually up. 

EXAMPLE 1 - Conventional HELLO message processing 

An LSA storm of size S is generated within an area of the network as a result of 
the failure or recovery of a single trunk, a group of trunks, a single node or a group of 
nodes. Focusing on a node N within the area, let L be the number of trunks attached to 
node N, and P be the average processing time needed either to receive a message or to 
transmit it over a single trunk. HELLO messages are exchanged every T seconds over 
every trunk. If no HELLO message is received for n Hello intervals (i.e., a period of time 
nT) then the trunk is declared down. R is the LSA retransmission timer value, i.e., if no 
acknowledgment is received for an LSA transmitted to a neighboring node within a time- 
■ period R, then it is retransmitted. 

In a conventional link state network, for each LSA message in the LSA storm, 
node N will get one original message over one of the L trunks, acknowledge it, and flood 
duplicate LSA messages over all trunks except the one over which the LSA arrived. The 
total processing time needed for this work can be computed as the product of the number 
of trunks L multiplied by the processing time P, i.e., LP. In the worst case, the node will 
also receive duplicate messages from all other trunks as well and the total processing time 
for that work is also about LP. So the total processing time at node N to process all 
messages resulting from the storm can be expressed as 2SLP. Since the inter-arrival time 
between HELLO messages is relatively long (typically 5 to 10 seconds), it is possible for 
all the work ( about 2SLP) at node N to arrive between successive arrivals of HELLO 
messages over a certain trunk. In that case, assuming conventional first-come-first-served 
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processing, the waiting time for the second HELLO message would be about 2SLP. 
Therefore, the condition under which a link will be declared down (even when it is actually 
up) due to not processing HELLO messages is given by 

2SLP>nT (1), or 

5 S > (2) 

Assuming L = 50, P = 1 ms, n = 3 (n is the number of HELLO intervals) and T = 5 
seconds, the above condition is satisfied for any LSA storm of size S > 150. In addition, 
assuming L= 50, P = 1 ms, n - 4 and T= 10 seconds, the above condition is satisfied for 
any LSA storm of size S > 400. Also, from Equation (2) we see that if P is 2 ms instead 
10 of 1 ms then the values of S stated above would be 75 and 200 respectively. 

In the relatively simple analysis above, it should be noted that certain effects were 
not taken into account. For example, a portion of the total work of LSA processing 
would be done before the arrival of the second HELLO message. In addition, there would 
15 be other work resulting from other control messages that are not part of the LSA storm, 
either coming from a different node or triggered by the expiry of a timer. The above 
effects are opposing in nature and would partially cancel each other. 

EXAMPLE 2 - HELLO Message processing in accordance with the present invention for 
20 reducing or eliminating the chance of declaring trunks down even when 

thev are actually up. 
As described above, the fraction of the node processor dedicated to processing 

( 6 ^1 

Type / messages is FW i P i I ^W-Pj where F is the fraction of node processor 

U=i ) 

dedicated to routing control work, and Wj, Pj are the relative weight and processing time, 
25 respectively, for type / message. The rate at which type / messages may be processed is 



therefore FW i I 



Consider the guaranteed minimum rate at which HELLO 



messages can be processed. Note that the type 1 (Ql) message represents HELLO 
messages received from another node and the type 4 (Q4) message represents HELLO 
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messages generated by node N, i.e., HELLO refresh messages. The minimum processing 
rate occurs when the message processing times Pjs get their maximum possible values. 
Among the 6 message types, the processing times for types 1, 4 and 3 (HELLOs and 
acknowledgments to LS As) should be relatively small and unaffected by network 

5 congestion. Let their maximum value be P . The other three message types are LS As and 
may be large under network congestion and so we assume them to achieve the value P,^, 
the maximum allowed for any message type. Therefore, the guaranteed minimum rate for 

type / messages is FW i I ((W } + W 3 + W A )P + (W 2 + W 5 + W 6 )P fm ). Since node N has L 

trunks, the rate at which HELLO messages (either type 1 message or type 4 message) need 
10 to be processed at node N is (L/T). Therefore, HELLO messages are processed at or 
above the desired minimum rate if the following relation holds: 

For/=l and4, FWJ^W X +W, +^ 4 )P + (FF 2 +^ 6 )P max )> LIT (3) 

15 It should be noted that the above relationship is independent of the network congestion, 
i.e., the size of the "LSA storm." Let F=0.5, W, = W 3 = W 4 = 2, W 2 = W 5 = W 6 = 1 and 

A 

Wsum = 9, as described above in the polling table of FIG. 5, P-\ms- 0.001 sec, Pmax = 10 
ms = 0.01 sec, T =5 sec, and L = 50, the same values as used in conventional Example 1, 
above. 

20 

For the above parameter values, Equation (3) is satisfied so that HELLO messages 
are processed at the required rate regardless of the network congestion level. In fact, the 
condition continues to be satisfied for even a larger node connectivity, L, as long as the 
node connectivity does not exceed 138. By using larger values of Wj and W 4 , Equation 3 
25 can be satisfied for even larger node connectivity, unless the node connectivity gets so 
large that the node processor cannot keep up processing just the HELLO messages. 

EXAMPLE 3 - Conventional OSPF leading to LSA retransmission lockout 

Consider the same LSA storm example as in Example 1 . The LS As in the storm 
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originated by node N are transmitted over all L trunks and other LSAs are transmitted 
over all trunks except the one on which the message came. So, the total number of LSAs 
transmitted by node N is about SL. The maximum rate of LSA generation at node N due 
to retransmission timer expiry is about SLflL The maximum rate at which node N can 
5 process the retransmitted LSAs is (1/P). So, the condition for LSA retransmission lockout 
(i.e., the condition under which the node processor enters an infinite loop processing only 
retransmissions and nothing else) is given by 

fH Wot, 
10 S>% (5) 

Assuming L = 50, R = 5 seconds and P = 1 milliseconds, LSA retransmission lockout can 
occur for any storm of size S > 100. 

15 It should be noted that there are two opposing effects that have not been taken 

into account in the simple model above. The first effect is that, some of the LSAs are 
likely to be acknowledged before the onset of the first retransmission. The second effect 
is that besides the retransmitted LSAs considered above there are other control messages 
triggered by Hello refresh timers and LSA refresh timers. The first effect implies that the 

20 LSA storm size has to be bigger than what is shown in Equation (5) for the retransmission 
lockout to happen. The second effect implies that even an LSA storm of size smaller than 
what is shown in Equation (5) would cause retransmission lockout, or more generally, 
timer-triggered lockout. So the two effects are opposite in nature and tend to cancel each 
other. 
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EXAMPLE 4 - OSPF in accordance with the present invention reducing or eliminating 
retransmission lockout. 
As described above, the fraction of the node processor dedicated to processing 

/ 6 > \ 



type / messages is FW^ I 



and the guaranteed minimum rate at which 
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messages of type i are served is FW i I ((W, + W l + ^ )P + (fF 2 +W 5 +W 6 )P max ). There 
will not be a retransmission lockout since even if there are many messages to be 
retransmitted, a fraction of the node processing power is reserved for messages that are 
not triggered by the expiry of a timer, i.e., type 1, 2 and 3 messages. For 
acknowledgments to LSAs, i.e., / = 3, using the same parameter values as Example 2, the 
minimum guaranteed rate at which acknowledgments are processed is about 27.8 per 
second. So, even if a relatively large number of unacknowledged messages accumulate 
following a severe LS A storm, the unacknowledged messages are dissipated relatively 
quickly. Furthermore, the rate of dissipation can be increased further by increasing W 3 , 
the relative weight for the acknowledgment message queue Q3. 

One skilled in the art will appreciate further features and advantages of the 
invention based on the above-described embodiments. Accordingly, the invention is not to 
be limited by what has been particularly shown and described, except as indicated by the 
appended claims. All publications and references cited herein are expressly incorporated 
herein by reference in their entirety. 

What is claimed is: 
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